Proof of validity of first-order travel estimates 



Len Bos 

Dipartimento di Informatica, Universita di Verona, Italy 
Michael A. Slawinski 



Department of Earth Sciences, Memorial University of Newfoundand, St. John's, Newfoundland, Canada 



In the seminal paper by Dahlen et al |1| the authors formulate an important expression as a first-order estimate of 
traveltime delay. The authors left out a term which would at first glance seem nontrivial, on the basis that their intention 
was to derive the Frechet derivative linking the observed delay to the model perturbation (Nolet 2009, pers. comm.). 
Here we show that the derivation by 1 1 1 results in a first-order estimate even without anticipating a Frechet derivative, but 
instead remaining deductively in their Taylor-series formulation. Although a mathematical technicality, this strengthens 
the result of [ 1 1 by showing that it is intrinsically valid, requiring no external justification. We show also that ignoring the 
aforementioned term is not valid in general and needs to be supported by careful argument. 

1 Introduction 

The purpose of this paper is to clarify and strengthen the derivation of formula (65) of the seminal paper by (T], in which 
the use of crosscorrelation to estimate traveltime delays is discussed. Formula (65) is a first-order estimate of St, the shift 
in the peak position of the crosscorrelation of observed and synthetic pulses. At a certain point in the derivation, which 
we explain in detail below, the Taylor series used by [ 1 ] is of one order less than would be normally expected, and for a 
general equation it would not be possible to make the claim that the estimate is indeed of first order. However, we show 
that in this special context the claim is mathematically true for rather subtle reasons, without the need of justifying it by 
a subsequent geophysical application. Even though, certain subtleties might be of no interest to geophysicists using the 
results of Q, we offer this proof in the context of geomathematics and in a spirit of respect for the contributions of the 
late Tony Dahlen. 

Let us describe the derivation in question. The main entity studied is the crosscorrelation 



where s is the synthetic signal and s obs is the observed signal; this is equation (58) of Q]. Subsequently, they assume that 
s obs can k e wr itten as 
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This permits them to write equation (60): 



s(t)+Ss(t), 



r(r) 



7(r) +Sj(t) 



(2) 



1 



where 

7 (r) = / s(t- T)s(t)dt 
ti 

is the crosscorrelation of s with itself, and 

ta 

<5 7 (t) = y s(t - T)Ss(t)dt (3) 

is the crosscorrelation of the synthetic signal and the perturbation. 

The maximum of the autocorrelation, 7 (r), is at t = 0; the goal of JT| is to give a first-order estimate of the delay 
given by the maximum of T(t), which they call St. They obtain this estimate by taking the Taylor series of T(r) about 
t = 0, to the second order, differentiating this second-order series, and taking St to be given by the zero of this derivative. 
Herein is a technical subtlety. The second-order Taylor series should be 

T(r) - 7 (0) + a T7 (0)r + ^ tt7 (0)t 2 + <5 7 (0) + a T 5 7 (0)r + id Tr <h(0)r 2 + • • • , (4) 

so that 

<9 r r(r) = <9 r7 (0) + <9 tt7 (0)t + <9 T 5 7 (0) + <9 tt £ 7 (0)t + ■■■ , 
and the critical point of T(r), using the fact that 9 r7 (0) = 0, would be 

r grgKg) 

~ d TTl (o) + a TT <5 7 (o)- K} 

However, instead of the second-order series Q, [Tl use 

T(r) = 7 (0) + d Tl {i))T + icW 7 (0)T 2 + <5 7 (0) + d T S-/{0)T + • • • , (6) 
which is their equation (63); in other words, they neglect 

id Tr <5 7 (0)r 2 , (7) 

which results in 

d r 6<y(0) 

O T tJ{0) 

which is estimate (65) of (TJ . 

The absence of the term |7]) is not trivial mathematically, even if one might argue for it heuristically in the context of 
geophysical approximations. We have examined the documents of the late Tony Dahlen, which he sketched in preparation 
for (TJ, and which are available at Princeton University. Not having found a mathematical justification therein, we give 
one in Sections [3] and |4] below. Before doing so, however, we give an example that shows that care must be taken when 
using such a method to estimate critical points. 



2 



2 Illustrative Example 

We give an example showing that small perturbations may cause critical points to move great distances. Consider 
where 

a := 1 



(1 + e 2 ) 2 ' 

with 1 > e > 0, which implies that < a < 1. Thus, we can write 



1 + 1 2 v ; 1 + t 2 ' 

and conclude that 

a</0)<l, fet, 
and that /(i) has a unique maximum at t — 0: /(0) = 1. Furthermore, 

which attains its maximum absolute value at t = ±\/3/3: 

in other words, 
Now let 

'<*> = i + { t- h y 

where 6 := 1/e, be a perturbation to /(i). Since 

|ff(t)|<C <et, 

it is a small perturbation. Also, 

^)= 2e ( i + jr-W' 

and we see that g'(t) > for t < 0. Hence, /i(t) := /(i) + g(t) is such that 

h'(t) > for f < 0, 

and the perturbed function, h, has no negative critical points. 
Since for < t < 6/2, 

. //xi e6 1 e 4 

15 {t)] ~ (1 + 6 2 ) 2 = (1 + 1/e 2 ) 2 = (1 + e 2 ) 2 = ~ a ' 
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it follows that 

|^(f)|>l-a>^(l-a)>|/'(t)|, 

and, consequently, 

h'{t) ^ 0, < t < 6/2. 

In other words, any critical points of the perturbed function, hit), must be greater than b/2 = l/(2e), which can be 
arbitrarily large and distant from the original critical point of f(t), which is at t = 0. 



3 First-order estimates 

As illustrated in the above example, it is important to understand what is happening if one estimates a critical point by 
using Taylor series in this way. A critical point, c, of f(t) is a root of its derivative, f'(t). In the expectation that this 
critical point is near t = 0, it is reasonable to estimate /(f) by its Taylor series about f = 0, 

/W = /(0) + /'(0)t+i/"(0)i 2 + -- - . 

Differentiating, we get 

/'(*) = /'(0) + /"(0)t+--- , (10) 



and the estimate of the critical point is obtained by setting expression ( 10 1 to zero and solving for f to get 

c= _m 

f"(py 

which is analogous to expression (65) of (TJ. Notice that this is one iteration, c = t\, of Newton's method for finding a 
root of fit) with to = as starting point. Hence the relevant error estimates are those for Newton's method. 

Theorem 3.1 ([2], p. 56) Suppose that Newton's method is applied to the function F(t) with starting point t to obtain 
the estimate t\ of the true root t — z. Then 

for some 7/ between z and to . 

If | to — z\ is "small" then we may refine this error bound as follows. In this case jjr^zso that 

F"(y) F"jz) 
2{F'{ri)y~ 2( J F'(z)) 3 ' 

which we may regard as approximately constant, provided F'(z) ^ 0. Furthermore, we may expand 

F(to) = F(z) + F'0)(to -«)+••• = + F'(z)(t -*) + ••• 

so that 

if(*o)i«m*)ii*o-*i. 

Hence we arrive at, under suitable technical assumptions, that 

\ti~z\ <C\t -z\ 2 (11) 
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Figure 1 : An example of the Newton estimate being far from a root 



where C is a constant that depends on F, z and t§. Notice that this is a restatement of the fact that Newton's method is 



quadratically convergent. But we must be careful; estimate ( 1 1 1 is local in nature — it depends strongly on the assumptions 



that to is already close to z and that F'(z) is not close to 0. If this is not the case, then there is nothing reasonable to say 
about \t\ — z\. Figure [TJillustrates this problem. 

Returning to our problem, we use F(t) := d T {j(r) + 6-f(r)} with r = 0, so that 

d T S 7 (0) 



T\ = 8t of expression ([5]). Hence we conclude that indeed estimate <|5j does provide a first-order approximation to the 
delay, provided that the true critical point is close to zero and that <9 TT 7(0) + d TT 5"f{Q) is not close to zero. 



4 Conclusion: Why estimate @ is of first order 



We consider the difference between expression d5), which we know to be first-order, and estimate ([8]) of [1|, which we 
wish to show is also first-order. In fact we claim that the difference between these two expressions is of second order, 
from which our claim follows. Specifically, we have 



cU7(0) 



9 T <5 7 (0) 



C> tt7 (0) + OrrSjiO) a Tr7 (o) 



|9 T <5 7 (0)| 

|M7(0)| 



1 



drrl(0) 



- a TT <5 7 (0) 



cW 7 (0) 



a Tr7 (o)(9 TT7 (o) + a Tr <5 7 (o)) 



(12) 
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Following expression ([3]), we write 

|«J 7 (0)| = J s{t)8s(t)dt 
ti 

which we rewrite symbolically as 

1*7(0)1 < Cx\S8\ 
Herein and below, Ct, C%, etc. stand for generic constants. 
Taking the derivative, we have 



< max \5s(t)\ / \s(t)\dt, 



|Sr*7(0)| 

which we rewrite symbolically as 
Differentiating again, we have 



d t s(t)8s(t)dt 



ti 



< max \Ss(t)\ I \d t s(t)\dt, 

t-\ < Ct < Ct2 



\d T S-f(0)\ < C 2 \5s\ 



(13) 
(14) 



< C 4 \Ss\ 



\d TT 6j(0)\ <C 3 \Ss\. 

Using estimates ( 13 1 and ( 14 1 in expression ( |l2| ), we have, for sufficiently small \8s\ and d Tr7 (0) ^= 0, 

a rr7 (o) + a TT <5 7 (o) a rr7 (o) 

which means that the difference between the first-order estimate <j3j and estimate §8§ of [ 1 1 is second-order in 8s. Hence, 
we conclude that estimate ([8} is indeed a first-order estimate. 

To demonstrate that estimate ([8]) is of the first order, we remained in the context of steps (58) to (65) of Q, which are 
expressions ([TJ and ([8]), herein. There is no need to anticipate Frechet derivatives that are to result from this derivation; in 
other words, there is no need to let 8j tend to zero, as suggested by Nolet (2009, pers. comm.). Hence, the formulation 
herein, albeit technical, strengthens the first-order validity of estimate ((SI of (TJ. 
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